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A/-glycosylation is one of the most common and complex post-translational modifications of 
eukaryotic proteins and one that has numerous roles, such as modulating protein stability, 
sorting, folding, enzyme activity, and ligand interactions. In plants, the functional signif- 
icance of /V-glycosylation is typically obscure, although it is a feature of most secreted 
proteins and so is potentially of considerable interest to plant cell wall biologists. While 
analytical pipelines have been established to characterize yeast, mammalian, and bacte- 
rial A/-glycoproteomes, such large-scale approaches for the study of plant glycoproteins 
have yet to be reported. Indeed, the A/-glycans that decorate plant and mammalian or 
yeast proteins are structurally distinct and so modification of existing analytical approaches 
are needed to tackle plant /V-glycoproteomes. In this review, we summarize a range 
of existing technologies for large-scale A/-glycoprotein analysis and highlight promising 
future approaches that may provide a better understanding of the plant A/-glycoproteome, 
and therefore the cell wall proteome and other proteins associated with the secretory 
pathway. 
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INTRODUCTION 

Characterization of the detailed structures of N-glycoproteins can 
provide valuable insights into basics aspects of cell and organismal 
biology and is also becoming increasingly important for pharma- 
ceutical development and production (De Marchis etal., 2011; 
Xu etal., 2011; Yang etal., 2012). In plants, N-glycosylation is 
a common feature of secreted proteins and there is consider- 
able interest in developing more advanced analytical platforms 
to characterize N-glycosylation in order to provide new insights 
in the plant cell wall proteome (Minic et al., 2007; Marino et al., 
2010; Zhang et al, 2010; Catala et al, 201 1; Ruiz-May et al, 2012). 
However, structural elucidation of glycans, glycopeptides, and 
glycoproteins is notoriously difficult as glycans are structurally 
complex, being branched, and containing a variety of types of 
intersaccharide linkages, and heterogeneous, with a wide range 
of abundance between glycoforms and occupancy of glycosylated 
sites. Ideally, a comprehensive analysis of a plant glycoprotein 
involves the identification of glycosylated peptides, determination 
of the location of the glycosylation sites, and elucidation of the 
glycan structure (Fitchette etal., 2007), each of which presents a 
specific set of technical challenges. 

In the past decade, biological mass spectrometry (MS) has 
become established as a core analytical tool for proteome pro- 
filing (Thelen and Peck, 2007; Yates etal, 2009; Ytterberg and 
Jensen, 2010; Song etal., 2011) as well as characterization of 
protein glycosylation patterns and glycan analysis (Harvey, 2005; 
Park and Lebrilla, 2005). However, while advanced tandem MS 
(MS/MS) technologies provide an effective means for glycopep- 
tide amino acid sequencing and obtaining structural information 



from oligosaccharides (e.g., sequence, linkages, and branching), 
the high degree of complexity and variability in glycoprotein 
populations from any eukaryotic source still poses significant 
analytical obstacles. Sample preparation that allows enrichment 
of the targeted glycoproteins/glycopeptides is therefore often an 
important step as it increases the concentration of the desired 
analyte while reducing sample complexity. A second technical hur- 
dle is the hydrophilic nature of glycans and their relatively larger 
masses compared with the native tryptic peptides to which they are 
attached. This often results in poor retention on the reverse phase 
chromatographic materials that are typically used in peptide sepa- 
rations, reduced ionization efficiency and larger than optimal mlz 
detection range in MS analysis, all of which can contribute to poor 
sensitivity (An etal, 2009; Ytterberg and Jensen, 2010). Further- 
more, the labile nature of the glycan-peptide bond often causes 
the neutral loss of individual carbohydrates and few fragmented 
ions from the peptide during the collision induced dissociation 
(CID), yielding little or no information for peptide identification, 
glycol-site determination, and full glycan sequence. As a result, 
deglycosylation is often required and the free peptide and gly- 
can are subjected separately to tandem MS (MS/MS) analysis. 
Finally, due to the high glycan heterogeneity, determination of 
the glycan structure through bioinformatics tool-based database 
searching is often difficult and requires time-consuming manual 
interpretation. 

To overcome these analytical challenges, many new strate- 
gies have been developed and implemented over the past decade 
involving the development of three major tools: (1) selective 
enrichment technologies for glycoproteins, glycopeptides, and 



www.frontiersin.org 



July 2012 | Volume 3 | Article 150 | 1 



Ruiz-May etal. 



Characterization of the plant W-glycoproteome 



released glycans, and improvements in separation science includ- 
ing the miniaturization of chromatographic formats, capillary 
electrophoresis (CE), and ion mobility technologies; (2) enhanced 
MS technology and development of more effective analysis work- 
flows; and (3) newly developed software packages and algorithms 
for interpretation of MS fragmentation spectra. Most of these 
technologies have been initially developed to study mammalian 
glycoproteins and there are only a few reported examples of 
their successful application to characterize plant N-glycosylation 
(Fitchette etal, 2007; Bardor etal, 2009; Zhang etal., 2012), 
which is structurally distinct from that of mammalian or yeast 
proteins (Ruiz-May etal., 2012). Key structural differences, such 
as the absence of non-reducing terminal sialic acid in plant 
N-glycans and the presence of a-l,3-fucose (Fuc) in the inner- 
most GlcNAc residue necessitate tailoring of existing analytical 
workflows. 

In this review we present some of the analytical platforms 
or strategies, including new developments in MS instrumenta- 
tion, which can be used for the systematic characterization of 
N-glycoproteins, with particular reference to the challenges of 
studying those from plants, as well as a perspective of future 
developments in the field. To our knowledge there are no pub- 
lished reports describing large- or even intermediate-scale analysis 
of plant protein glycosylation together with associated N-glycans 
analysis. However, given the new and emerging analytical pipelines 
we are soon likely to see a dramatic expansion in the identifica- 
tion and functional characterization of plant N-glycoproteomes, 
which in turn will be of great value for researchers studying the 
plant cell wall and associated secretory pathways. 

GLYCOPROTEIN DETECTION 

A summary of a typical workflow for characterizing N- 
glycoproteins/larger scale N-glycoproteomes is shown in Figure 1. 
A valuable first step is to confirm the presence of glycoproteins in 
a crude extract using ID or 2D gel-based electrophoretic sepa- 
ration, followed by immunoblot analysis or direct visualization 
using a commercially available sugar detection stain (Fitchette 
et al., 2007; Bardor et al., 2009). Antibodies to horseradish peroxi- 
dase (HRP) can recognize glycoproteins with an epitope associated 
with pi— >2 xylose and/or al— >3 fucose structures on complex- 
type N-glycans (Fitchette et al, 2007; Bardor et al, 2009; Figure 2). 
They are therefore useful for detecting plant N-glycoproteins since 
plant complex- type glycans carry f5 1^-2 xylose and/or al— >3 core 
fucose structures (Strasser et al., 2008). This is a relatively straight- 
forward technique, but does not distinguish between different 
types of N-glycans, or between N- and O-linked glycans. The 
presence of glycoproteins can also be suggested by the presence 
of smeared bands on SDS-PAGE gels (Zhang etal, 2010; Catala 
etal, 2011), resulting from the presence of heterogenous glycans. 
Another practical, but sometimes less informative means to assess 
the presence of glycoproteins in a sample is by comparison of 
changes in the electrophoretic mobility of the constituent pro- 
teins following specific deglycosylation steps (Fitchette et al., 2007; 
Bardor etal, 2009). It is important to note that that the enzyme 
PNGase-F, which is typically use to deglycosylate mammalian pro- 
teins does not process N-glycans that contain an a-l,3-fucose 
residue that is a common component of plant N-glycans, although 



enzymes such as PNGase-A can be used as an alternative (Ytterberg 
and Jensen, 2010). 

GLYC0PR0TEIN/GLYC0PEPTIDE ENRICHMENT 
AND PREPARATION 

Depending on the protein complexity in the initial samples, the 
analysis of glycoproteins then often proceeds with the enrich- 
ment of glycoproteins or glycopeptides by specific lectins or 
immuno affinity chromatography (Rudiger and Gabius, 2001) to 
remove non-glycosylated forms. Lectin chromatography using 
Concanavalin A (Con A) has been reported for the enrich- 
ment of N-glycoproteins from diverse sources including human 
serum, Caenorhabditis elegans plasma membrane glycoproteins 
(Bunkenborg etal, 2004; Fan etal., 2004), and plant tissues 
(Minic etal, 2007; Zhang etal., 2010; Catala etal, 2011). Vari- 
ous plants lectins are available with affinities for different sugars 
(Gabius etal., 2004) and such lectins have been used sequen- 
tially (Yamamoto etal, 1995, 1998), in parallel (Yang etal, 
2006; Lee etal, 2009; Zielinska etal., 2010), and as mixtures 
(Li etal, 2004) to increase the range of identified glycoproteins. 
When working with limited amounts of sample it is often nec- 
essary to miniaturize the chromatography. Both silica-based and 
monolithic materials incorporating a lectin-containing station- 
ary phase are now commercially available that can be packed 
into micro or capillary columns, greatly increasing the num- 
ber of glycoproteins that can be detected in low concentrations 
compared with standard column formats (Madera etal, 2006; 
Feng etal, 2009). 

Once an enriched glycoprotein sample has been prepared, a 
typical next step is to fragment the parent protein backbone and a 
potential technical challenge is that the glycan moiety may spatially 
interfere with proteolysis, particularly near the glycosylation site. 
This can result in missed cleavages, yielding glycosylated polypep- 
tides that are too large to be detected by some MS instruments. 
However, the use of less specific proteases, such as pepsin, ther- 
molysin, Proteinase K, either alone or in combination, can reduce 
this problem (Chen et al., 2009). Nevertheless, no matter what the 
digestion procedure, the degree of heterogeneity of the resulting 
mixture of peptides and glycopeptides can complicate the subse- 
quent MS analysis as glycopeptides often ionize less efficiently than 
non-glycopeptides and may be less abundant than their unmodi- 
fied counterparts. Therefore, in some cases, deglycosylation prior 
to or concurrent with enzymatic digestion is desirable (see below). 
If deglycosylation is not employed, an additional enrichment 
of glycopeptides following glycoprotein digestion is often imple- 
mented to improve specificity. An alternative chemical approach 
for glycoprotein/glycopeptides enrichment is the use of hydrazine 
resin. N-glycans bind covalently to this matrix after oxidation with 
periodic acid and can then be released with PNGase A, although 
some information related to the N-glycan structure is lost (Zhang 
etal., 2003). Other alternative chemical methods involve graphi- 
tized carbon (Larsen etal, 2005) or boronic acid-functionalized 
beads, since boric acid is able to form diesters with glycans that 
have ds-diol groups (Sparbier etal., 2005). 

Several such enrichment methods have been successfully 
applied in large-scale N-glycoproteome analyses, such as the 
combined use of lectin (Con A) affinity chromatography and 
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FIGURE 1 | A schematic overview of experimental approaches 
for the systematic characterization of glycoproteins by mass 
spectrometry-based proteomic analysis. Regardless of the sample 
sources, the general workflow includes three major steps: enrichment of 



glycoproteins, glycopeptides, and released glycans; mass spectrometry 
analysis of glycopeptides, glycans, and peptides; and interpretation of MS 
fragmentation spectra, allowing identification of core amino acid sequences 
of the glycopeptides, glycosylation sites, and potential glycan structures. 
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FIGURE 2 | Organ-specific immunodetection of glycan epitopes. 

Protein extracts from different tomato organs were fractionated 12% 
polyacrylamide gels and visualized by SYPRO Ruby protein staining 



(A) or transferred onto nitrocellulose membrane for immunoblotting 
using antisera that recognize p-1 ,2-xylose (B) or a-1 ,3-fucose 
(C) residues. 
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subsequent hydrophilic interaction chromatography (HILIC) 
enrichment of tryptic glycopeptides (Calvano etal., 2008; 
Selby etal., 2008), and size-exclusion chromatography has also 
been used to enrichment for tryptic glycopeptides (Alvarez- 
Manilla etal, 2006). 

MS ANALYSIS 

Once enriched glycopeptide samples have been isolated, they are 
then commonly subjected to MS/MS analysis, or to a deglyco- 
sylation step for simplifying MS analysis of individual peptides 
and glycans. However, to overcome the relatively poor sensitivity 
resulting from the hydrophilic nature of glycans, online nano-scale 
LC enrichment coupled with nano-electrospray ionization tandem 
MS (nanoESI-MS/MS; Bahr et al., 1997; Sandra et al., 2004; Zhang 
and Van Pelt, 2004) or additional derivatization (Zaia, 2004) is 
often used to improve glycopeptide ionization efficiency. Alterna- 
tively, matrix assisted laser desorption/ionization (MALDI) MS is 
used extensively for direct analysis of the released glycan classes 
or followed by glycan derivatization, as has been reviewed in 
detail (Zaia, 2010; Antonopoulos etal, 2011). Nano-scale liq- 
uid chromatography electrospray ionization (nanoLC-ESI) allows 
for low concentration of analytes, improves ionization efficiency 
and allows for low flow rates, which in turn makes it possi- 
ble to use extended analysis times for MS/MS and multi-stage 
fragmentation (MS n ) oligosaccharide analyses (Bahr etal., 1997). 
Traditionally, low-energy CID MS/MS has been the method of 
choice for peptide and glycopeptides sequencing operating in 
either automated data-dependent acquisition mode, or manual 
selection for MS/MS through infusion analysis. Another unique 
feature of CID MS/MS analysis of glycopeptides or glycans is 
the production of oxonium ions, such as mlz 163 (Hex + H) + , 
mlz 204 (HexNAc + H)+, and mlz 366 (Hex-HexNAc + H)+ , 
which can be used as diagnostic marker ions, indicating the pres- 
ence of specific sugar compositions. The oxonium ions are often 
monitored in precursor ion scanning mode for highly selective 
detection of glycopeptides in digestion mixture and subjected to 
MS/MS and MS/MS/MS of the selectively detected glycopeptides 
ions, yielding complete peptide and glycan sequences (Sandra 
etal, 2004; Zhang and Williamson, 2005). It should be note that 
highly selective precursor ion scanning for sugar oxonium ions 
also allows direct analysis of relatively less complex protein digests 
without further enrichment of glycopeptides (Sandra etal., 2004; 
Zhang and Williamson, 2005). 

However, low-energy CID MS/MS can be rendered less effec- 
tive by the labile nature of the glycosidic bond and often fails to 
produce fragment ions that can be used to determine both the 
peptide sequence and glycosylation site. In this regard, two new 
dissociation technologies, electron capture dissociation (ECD; 
Zubarev et al., 1998) and electron transfer dissociation (ETD; Syka 
etal., 2004), an analogue of ECD, provide valuable alternative 
approaches. Both ECD and ETD are non-ergodic fragmenta- 
tion techniques that induce backbone fragmentation at N-Cct 
bonds and create complementary c- and z-type ion series, yielding 
information that is highly complementary to conventional CID 
fragmentation (Zubarev etal., 1998; Syka etal., 2004; Coon etal., 
2005). ETD MS is becoming more widely accessible as it enables 
the incorporation of ECD-type fragmentation in more common 



RF ion trap mass spectrometers, thus eliminating the need for 
highly sophisticated higher-cost FTICR MS instruments. ETD 
cleaves the peptide backbone in a sequence-independent manner, 
leaving the glycan preserved on the peptide and is increasingly 
recognized as an important alternative dissociation technique to 
CID for glycosylation analysis (Catalina et al., 2007; Khidekel et al., 
2007; Wuhrer etal, 2007b). With the combination of CID and 
ETD fragmentation available in single ion trap instruments, it is 
now possible to analyze intact glycopeptides and determine both 
peptide and glycan sequence information (Wuhrer etal., 2007a). 
To some degree the development of these non-ergodic fragmen- 
tation methods, which can be used to structurally characterize 
much larger analytes, has reduced the need to ensure that the 
proteolytic digestion proceeds to the point where the mass con- 
tribution of the peptidyl moiety is minimal. Wu etal. (2007) 
have successfully demonstrated the use of endoproteinase Lys-C 
to study the glycosylation of several model proteins using a com- 
bination of ETD and CID. This approach reduces the number 
of non-glycosylated peptides, thereby facilitating the characteri- 
zation of the glycopeptides and reducing the need for additional 
chromatographic steps. 

The development of ion mobility spectrometry (IMS) has 
extended the ability to characterize glycoprotein/glycopeptides to 
the realm of isomeric and conformational structure (Plasencia 
et al., 2008). For example, Olivova et al. (2008), using the high res- 
olution provided by a recently developed quadrupole ion-mobility 
time-of-flight instrument, was able to determine the total glyco- 
sylation profile of a monoclonal antibody. The IMS function was 
used to resolve the light and heavy chains in the gas phase allowing 
the accurate mass measurement of each. Furthermore, the inno- 
vative dual-collision cell design of this instrument was leveraged 
in a two step fragmentation process to provide a detailed charac- 
terization of the glycan structures and the determination of the 
glycosylation site. 

Other than direct MS analysis of enriched glycopeptides, deg- 
lycosylation steps can also be performed to gain additional infor- 
mation about peptide sequence and complete glycan structure. 
Three enzymes are commonly used for enzymatic deglycosylation. 
PNGase F, an amidase that hydrolyzes the amide bond between the 
Asn residue on the peptide backbone and the innermost HexNAc 
unit, is active on high mannose and complex plant N-glycans, 
except those with an ot-l,3-fucose residue linked to the inner- 
most HexNAc (Fitchette etal, 2007; Bardor etal, 2009). In this 
case, PNGase A can be used to release glycans from relatively 
short glycopeptides (Fitchette etal., 2007; Bardor etal, 2009). 
Alternatively, endoglycosidase H (Endo H) cleaves the glycosidic 
bond between the two GlcNAc residues on the high-mannose- 
type N-glycans of plant glycoproteins (Harvey, 2005). While these 
deglycosylation steps are time consuming it has been shown that 
pressure cycling and exposure to microwaves can greatly reduce 
deglycosylation times (Chen etal., 2009). Chemical deglycosy- 
lation can also be used, although enzymatic deglycosylation is 
generally favored as chemical approaches often introduce unex- 
pected modifications of peptide side chains (Fitchette etal., 2007; 
Bardor etal, 2009). 

After deglycosylation, LC-MS/MS is used to identify the amino 
acid sequence of the deglycosylated peptide. Alternatively, the 
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released glycans can be further purified and analyzed by MALDI- 
TOF or LC-ESI-MS/MS analysis to identify the glycan structure 
(An etal., 2009). In practice, MS/MS spectra derived from the 
glycosidic bond fragment ions, along with cross-ring fragmen- 
tation of glycans and treatments with different exo-hexosidases 
are required for complete structural elucidation (Zaia, 2004). In 
the past, per- derivation, such as permethylation, of glycans has 
often been used to improve sensitivity and has yielded more struc- 
turally relevant fragment ions during MS/MS. However, as new 
MS technologies are constantly being developed, the sensitivity 
improvements provided by derivatization has only been marginal 
and insufficient to compensate for the more complex sample 
preparation and the fact that derivatization makes it difficult to 
employ glycosidase treatment to determine linkages and identify 
saccharide units (Kang et al., 2007). Another technique, called iso- 
tope coded glycosylation site-specific tagging (IGOT), has been 
used for the large-scale identification of N-linked glycoproteins 
from complex biological samples of C. elegans (Kaji et al., 2003, 
2007) and mouse (Zielinska etal., 2010). This approach involves 
protein tryptic digestion and enrichment of resulting glycopep- 
tides followed by conversion of glycosylated Asn residues into 
ls O-labeled aspartic acids by PNGase F digestion in ls O water. 
The subsequent 3 Da mass difference created by this process 
is readily identified by LC-MS/MS analysis, indicating N-linked 
glycosylation sites. However, a potential pitfall of this approach 
is that a significantly high false positive rate (~60%) has been 
reported (Angel etal., 2007), apparently due to the fact that the 
trypsin used for proteolysis was still active, leading to the incor- 
poration of ls O into the C-termini of the peptides during the 
deglycosylation step. 

Capillary electrophoresis has been used to analyze the het- 
erogeneity of isolated glycoproteins with MS detection (Balaguer 
and Neususs, 2006; Thakur etal, 2009; Szabo etal, 2010) and 
the incorporation of microfluidic devices can also greatly facil- 
itate the analysis by increasing sensitivity and reducing analysis 
times (Zhuang etal, 2007, 2011). While these CE/MS methods 
have proven useful as a means to characterize glycoprotein hetero- 
geneity and to investigate the structure of glycans at the N- and 
O-glycosylation sites of recombinant glycoproteins (Zaia, 2010), 
they have not yet proven sufficiently robust for the analysis of 
complex glycoprotein mixtures. 

QUANTITATIVE /V-GLYC0PR0TEIN ANALYSIS BY MS 

In principle quantitative glycoproteomics experiments maybe car- 
ried out by the same assortment of techniques that have been 
developed and are currently in use for labeling and label-free 
approaches in other proteomics disciplines. There have been sev- 
eral excellent reviews (Bindschedler and Cramer, 2011; Matros 
etal., 2011, Pan etal, 2011) and the reader is referred to these 
for detailed descriptions of the various experimental strategies 
employed and methodologies in use. 

Historically, the quantification of proteoglycans has been dif- 
ficult as it involves both the structural elucidation of glycan and 
peptide moieties. While structural characterization of peptides 
is well developed and is facilitated by the existence of complete 
genomic databases for a growing number of species, the struc- 
tural characterization of the glycan moiety remains a challenge as 



they are often exceedingly heterogeneous and chemically complex, 
with differing chemical configurations and abundant isoforms. 
Furthermore, glycans are frequently branched and contain an 
assortment of glycan linkages (Zaia, 2004). These complications 
combine in such a way that quantitative experiments must be car- 
ried out in different tiers with each providing a different level 
of quantitative information. The first tier involves determining 
glycosylation sites and their degree of occupancy. For N-linked 
glycans potential glycosylation sites can be recognized due to 
the presence of the consensus sequence Asn-X-Ser/Thr using 
any of several open access programs such as Net Glyc (http:// 
www.cbs.dtu.dk/services/NetOGlyc/). However, not all potential 
sites are occupied and those that are occupied are often only par- 
tially so but by a variety of differing glycans. Determining the num- 
ber and types of glycoforms and their relative proportion present 
at each site represents the second tier of quantifications. The third 
tier of glycoprotein quantification involves comparing the state of 
glycosylation between different samples, i.e., control verses treated, 
control verses diseased, etc. Of the methods available, only MS 
allows for quantitative characterization at all three tiers. 

Two-dimensional gel electrophoresis with glycan-specific stain- 
ing has been the traditional method used to identify and quantify 
glycoproteins (Bardor etal., 2009). When coupled with MS of 
tryptic digests of excised spots it can provide quantitative char- 
acterization on all three levels. However, the heterogeneity and 
structural complexity of the glycans has important consequences 
on a protein's mobility which complicate the interpretation of 
the electropherograms. Thus, LC-MS has emerged as the preem- 
inent tool for the identification and quantitative characterization 
of glycoproteins and glycopeptides. The development of novel 
hybrid mass spectrometers incorporating ion mobility cells, high 
mass accuracy and resolution, improved sensitivity, and scanning 
capabilities has allowed detailed glycoprotein structural charac- 
terization, with accurate empirical information concerning the 
position and degree of occupancy of the glycosylation sites as well 
as the composition, structure, and distribution of glycoforms. 

In the discovery phase of research, most quantitative MS strate- 
gies involve either a labeling approach (Gygi et al, 1999; Kaji et al, 
2003; Zhang etal, 2003; Ross etal, 2004; Aggarwal etal, 2006; 
Atwood etal., 2008; Haqqani etal., 2008) or a label-free strategy 
(Hill etal, 2009; Rebecchi etal, 2009; Zhang etal, 2012). These 
non-targeted approaches allow one to quickly survey complex pro- 
teomes to determine the features that vary between samples and 
more precise quantitative information can be obtained through a 
targeted approach involving specific reaction monitoring (SRM) 
as recently described by Zhang etal. (2012). New developments 
involving data independent acquisition (DIA) using approaches 
such as MS E or SWATH (Levin et al, 201 1; Gillet et al, 2012; Hopf- 
gartner etal., 2012) could potentially simplify these procedures 
further by allowing the discovery and targeted data to be collected 
simultaneously greatly reducing the instrument time required. 

BI0INF0RMATIC TOOLS 

One of the biggest challenges in the field of glycoproteomics 
is the development of bioinformatic tools for glycan analysis. 
As a consequence of the complex fragmentation behaviors of 
glycopeptides and glycans, which tend to vary under different CID 



www.frontiersin.org 



July 2012 | Volume 3 I Article 150 I 5 



Ruiz-May etal. 



Characterization of the plant W-glycoproteome 



conditions utilized by various MS instruments, and the high level 
of glycan heterogeneity, there are few tools for direct large-scale 
identification of glycoproteins through database searching. As a 
result, tedious manual data interpretation represents the major 
bottle-neck for glycoprotein characterization. However, several 
computer programs have been developed, such as GlysodelQ 
(Joshi etal, 2004), GlyMod (Cooper etal., 2001), GlycoMiner 
(Ozohanics et al, 2008), and Peptoonist (Goldberg et al, 2007), to 
aid identification of intact N-linked glycopeptides and oligosac- 
charides, as well as CartoonistTwo (Goldberg etal, 2006) for 
automatic determination of the topology of O-glycan without 
linkage information from MS/MS spectra. Nevertheless, there 
is an unquestionable need for improved bioinformatic tools to 
keep pace with the rapid advances in MS technologies and strate- 
gies for sample preparation, such as software for the automatic 
determination of N-linked glycopeptides. 

DISCUSSION AND PERSPECTIVES 

As described above, the characterization of the plant N- 
glycoproteome is methodologically challenging due to the extreme 
structural heterogeneity and the often low concentration of sev- 
eral glycoforms attached to each glycopeptide residing in the total 
peptide pool. However, several approaches have been established 
to study specific population of glycoproteins from different tis- 
sues. Until now, there had not been a single enrichment approach 
that permits the analysis of the complete population of glycopro- 
tein in a specific tissue, but lectin affinity chromatography is an 
effective option because it permits the purification and analysis of 
native glycoproteins, allowing the characterization of endogenous 
glycoforms without any modification of the N-glycans structure. 
Moreover, the availability of several lectins with different ligand 
affinities allows broader coverage of the N-glycoproteome, as has 
been described in mammals (Zielinska etal, 2010). 



As far as we are aware, there have been no such reports 
describing the application of large-scale enrichment techniques 
in combination with cutting-edge MS/MS methods for the direct 
analysis of intact plant glycopeptides to gain information of both 
glycan and peptide moiety. However, the establishment of new 
MS platforms, such as the use of CID and ETD fragmentation in a 
single ion trap instrument for simultaneous analysis of the sugar 
and protein components of glycopeptides (Wuhrer etal., 2007a) 
is likely to revolutionize the study of plant glycoproteins. Rapid 
progress will depend on other technical advances such as solving 
the problems of the recalcitrance of plant proteins to deglycosy- 
lation by standard PNGase F treatments and the development of 
new, robust bioinformatics tools. 

Taken together, such developments hold great promise for 
plant researchers in many fields, including those interested in the 
plant cell wall. Increasing numbers of wall localized proteins are 
being identified (Ruiz-May etal., 2012) and yet remarkably little 
is known about the functional significance of post-translational 
modifications such as glycosylation. New analytical pipelines such 
as those described here, perhaps coupled with other techniques 
that will likely result in glycoprotein enrichment (e.g., Parsons 
etal., 2012), hold great potential in this regard. This represents 
an important new frontier in plant cell wall biology and it can be 
argued that the advent of the plant N-glycoproteome starts now. 
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